Visual Clustering Approaches
نویسندگان
چکیده
Fig-1 The data stored in the world are rapidly growing. This growth of databases has far outpaced the human ability to interpret this data creating new phenomena of big data. Big data is difficult to work using most existing tools, automatic methods and visualization. Some new methods called Visual data Mining have recently appeared trying to involve more significantly the user in the data mining process and using more intensively the visualization. We think that is important to consider user perception to drawback dimension selection process, select initial seed of clustering algorithm, and select interactively the clusters for example. Inspired by these ideas, we propose a semi-interactive algorithm (Fig1 [1]) that we have developed, which integrate an automatic algorithm, an interactive evolutionary algorithm and visualization tools. This Approach can be applied on parsimonious clustering where clusters can be overlapping and we can detect interactively the relation between overlapped clusters ([2] Fig 2). We propose also an interactive clustering approach, where we can identify visually clusters by combining different projections of interactive visualizations. In many situations where visual perception is more effective than classical clustering methods, the proposed approach gives better results ([3] Fig 3). We can apply these techniques combined with a new iterative clustering approach that extract compact clusters one by one, in this aspect the visualization is very important, the user can stop or continue the process according to the obtained information [4]. These approaches can be applied on different problems on big data: data steam, social networks [5], …
منابع مشابه
به کارگیری روشهای خوشهبندی در ریزآرایه DNA
Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...
متن کاملSignal processing approaches as novel tools for the clustering of N-acetyl-β-D-glucosaminidases
Nowadays, the clustering of proteins and enzymes in particular, are one of the most popular topics in bioinformatics. Increasing number of chitinase genes from different organisms and their sequences have beenidentified. So far, various mathematical algorithms for the clustering of chitinase genes have been used butmost of them seem to be confusing and sometimes insufficient. In the...
متن کاملEvaluating Different Approaches to Permeability Prediction in a Carbonate Reservoir
Permeability can be directly measured using cores taken from the reservoir in the laboratory. Due to high cost associated with coring, cores are available in a limited number of wells in a field. Many empirical models, statistical methods, and intelligent techniques were suggested to predict permeability in un-cored wells from easy-to-obtain and frequent data such as wireline logs. The main obj...
متن کاملProposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms
In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...
متن کاملSoftware Refactoring Approaches: A Survey
The objective of software refactoring is to improve the software product’s quality by improving its performance and understandability. There are also different quality attributes that software refactoring can improve. This study gives a wide overview of five primary approaches to software refactoring. These are two clustering approaches at class level and two at package level, as well as one gr...
متن کاملA Visual Framework Invites Human into the Clustering Process
Clustering is a technique commonly used in scientific research. The task of clustering inevitably involves human participation – The clustering is not finished when the computer/algorithm finishes but the user has evaluated, understood and accepted the patterns. This defines a human involved “clusteringanalysis/evaluation” iteration. Instead of neglecting this human involvement, we provide a vi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013